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Safe languages 


e Enforce safety at compile and run time 
Type safe: values used only at their declared type 


© Memory safe: programs write only in their own (allocated) 
memory 


® No undefined behavior 

© Examples: C#, Java, Modula-3, Cedar/Mesa 
e Benefits 

© Higher programmer productivity 


© Detect errors at compile or run time leading to common 
security attacks 


© Buffer overruns, stack smashing, double de-allocate 
® More reliable programs 


Native code 


e Machine code that runs directly on a 
processor 


e Example: x86 code 
e Typically used for: 
e OS-level programming (Windows) 
e Commercial desktop applications (Office) 


e Webserver/browser implementations 
(Internet) 


The problem 


@ Safe languages not usable for native code 
development 


e Safe programs compiled to bytecode 


e Bytecode runs within a runtime system (CLR 
or JVM) 


@ How can we use safe languages to create 
native code programs? 


e C# -> x86 instead of C++ -> x86? 


@ System we developed that shows how you 
can use safe languages for native code. 


e MSIL -> x86 / x64 


e Focus is on supporting systems 
programming 


e Consists of: 
e Ahead-of-time compiler 
e Lightweight runtime library (32 KLOC) 


Supporting systems programming 


e Focus is supporting systems programming 
e Bartok runtime library written mostiy in C# 

e Bartok is the compiler and runtime library 
for Singularity 

e Singularity is a research OS written almost 
entirely in C# 
e See hip //sinaularıy for more information 


® Overview of Bartok 
e Performance results 
® Conclusion 
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@ Optimizing ahead-of-time compiler 
e Supports a range of compilation modes 
e Per-function, per-assembly, whole program 
e Optimizations include 
e Advanced interprocedural opts 
e “Textbook” per-procedure optimizations 
e Inlining 
e Non-goal: product quality code generation 


Interprocedural optimizations 


e Optimizations that cross methods 
e Class to whole program in scope 
e Affects patching story 
e Patching/opt granularity must be same 
® Benefits: 
e Eliminate redundant safety checks 
e Null checks, type casts, bounds checks 
e Improve OO programs 
e Devirtualization, tree shaking 


Interprocedural optimizations 


@ Bartok optimizations not usually found in 
production compilers 
C++: 
e Safety check optimizations not done because 
e Programs are unsafe (no bounds checks) 
® Programmers do optimizations by hand (static_cast) 
e OO optimizations developed for type-safe 


settings. 
e Lack of language type safety complicates 
implementing them 


@ C#: too costly to do during just-in-time 
compilation 


Modular, lightweight runtime 
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Code size: 


Garbage Virtual table and 
GCs: 20K lines collection 


Threading: 2K lines 


Type tests: 600 Ines NNFARASAIAGENd 
ie BE 


objectlayout 


VTables, object 

P/invoke: 500 lines Core aati 
Core datatypes: SK floating point 
Jines handling arrays. strings. 


Total: ~32K lines 


This is a research prototype, not a product- 
quality system. 

e Not as rich an environment as CLR 
e Porting .NET .libraries on demand 

e DLLs not implemented 

@ No COM interop 

e No CAS 

@ Minimal support for reflection 


e Overview of Bartok 
e Performance results 
e Conclusion 
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Tree shaking code size reductions fi 
ingularity_ components; 


Code Code w/ 

Program Whole Tree Shake % Reduction 
Kernel 2.37 MB 1.29 MB 46% 
IDE Disk Driver 1.85 MB 455 KB 75% 
Web Server 2.73MB 765 KB 72% 
Content Extension 2.14 MB 502 KB 77% 


Singularity Windows FreeBSD53 Linux 2.6.11 


XP (SP2) (Red Hat FC4) 

C- static lib 54K 232K 664K 

C++ - static lib 572K 70K 1,216K 
C#- wi GC 408K 3,750K 
(CLR) 


© CH process w/ GC can have similar memory footprint to 
C++ w/ heap 


e Phoenix is Microsoft's new compiler and 
programming tools infrastructure 


® Bartok compiled managed Phoenix 
backend 


e Phoenix compiles to 100% native code or 
100% managed code 


synthesized code) 


e Resulting backend can compile NT kernel, -2600 
regress suite files 


@ Performance of Phoenix backend, averaged over 
the -2600 regress files 
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Technology transfe 


e Target for compiler is Phoenix 


e Same optimization technology for C++/native 
and C#/native 


e Plus additional Bartok opts 


e DevDiv and MSR working together to 
integrate Bartok compiler w/ Phoenix 


ə Goal: support incubation efforts 
e Virtual team of 5 devs 


e monn Bartok runtime a . 


e Bartok allows you to write native code in 
safe languages like C#: 


e Optimizing ahead-of-time compiler 
e Modular runtime library 


e Brings benefits of safe languages to 
native code 


Follow up 


e For more information: 
e http://sharepoint/sites/bartok 
e htip:/isinqularity 


e We plan to make a binary drop of Bartok 
available for internal experimentation 


e Email me ifinterested (alias: dtarditi) 


Questions? 


